A Novel Analytical Approach for Lip Synchronization
نویسندگان
چکیده
We present a novel approach for Lip synchronization by analyzing the relationship between a person’s speech signal and data extracted from his/her lip movements. To model the speech we use a nonlinear-time-varying sum of AM-FM signals each of which models a single formant frequency. The model is then realized using Taylor series expansions such that a closed form formula is achieved which shows the relationship between the speech amplitudes and instantaneous frequencies w.r.t lips varying width and height. Based on the obtained formula, lips movements data are employed to generate a semi-speech signal which is then correlated with the original speech over a span of delays. From the resultant correlation, the delay between the two signals is estimated, hence Lip Sync is achieved. The approach is applied to practical speech examples and the obtained results support the correctness and consistency of our proposed approach. The developed method can estimate delays around 0.1 second at low SNRs and 0.04 second at high SNRs.
منابع مشابه
A Survey – Audio and Video Synchronization
The audio and video Synchronization is extremely necessary. The synchronization loss between image and sound continues to disturb observers and irritate telecasters. The demand is to assure synchronization without adjusting content at the same time as still retaining price low. The objective of the synchronization is to line up both the audio and video signals that are processed individually. T...
متن کاملLinear matrix inequality approach for synchronization of chaotic fuzzy cellular neural networks with discrete and unbounded distributed delays based on sampled-data control
In this paper, linear matrix inequality (LMI) approach for synchronization of chaotic fuzzy cellular neural networks (FCNNs) with discrete and unbounded distributed delays based on sampled-data controlis investigated. Lyapunov-Krasovskii functional combining with the input delay approach as well as the free-weighting matrix approach are employed to derive several sufficient criteria in terms of...
متن کاملPerformance Enhancement in Lip Synchronization Using MFCC Parameters
Many multimedia applications and entertainment industry products like games, cartoons and film dubbing require speech driven face animation and audio-video synchronization. Only Automatic Speech Recognition system (ASR) does not give good results in noisy environment. Audio Visual Speech Recognition system plays vital role in such harsh environment as it uses both – audio and visual – informati...
متن کاملA Lip Localization Based Visual Feature Extraction Method
This paper presents a lip localization based visual feature extraction method to segment lip region from image or video in real time. Lip localization and tracking is useful in many applications such as lip reading, lip synchronization, visual speech recognition, facial animation etc. To synchronize lip movements with input audio we need to first segment lip region from input image or video fra...
متن کاملHybrid Control to Approach Chaos Synchronization of Uncertain DUFFING Oscillator Systems with External Disturbance
This paper proposes a hybrid control scheme for the synchronization of two chaotic Duffing oscillator system, subject to uncertainties and external disturbances. The novelty of this scheme is that the Linear Quadratic Regulation (LQR) control, Sliding Mode (SM) control and Gaussian Radial basis Function Neural Network (GRBFNN) control are combined to chaos synchronization with respect to extern...
متن کامل